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POLYPEPTIDE HAVING IMMPNQLOGICAL ACTTVTTY FOR USR A.c; 
DIAGNOSTIC REAGENT AND/OR VarrTMTi; 

TECHNICAL FIEI.D 

This invention relates to a polypeptide having 
5 immunological activity for use as a diagnostic reagent 
and/or a vaccine component. 
BACKGRODND ART 

Diagnostic kits for use in screening individuals for 
infection with human immunodeficiency virus (HIV) infection 
10 frequently include reagents comprising HIV antigens which 
are used to detect antibodies using known immunological 
techniques including ELISA, Western Blot, latex 
agglutination and immvmo-luminescent and immxino-fluorescent 
techniques . 

15 The effectiveness of such techniques however depends 
upon selection of suitable immunological reagents and one 
peurticulau: difficulty which 2u:ises is that particular 
reagents are often specific to individual strains or groups 
of strains of HIV. Thus, for example, known diagnostic 

20 reagents based upon HIV-1 may fail to detect antibodies 
resulting from an infection of a patient with Hrv-2. 

Similarly, in the production of vaccines designed to 
protect individuals against HIV infection, the use of 
antigens derived from one particular strain of HlV may fail 

25 to provide adequate protection against infection with other 
strains. 

It is an object of the present invention to overcome 
such problems. 

DISCLOSURE OF INVENTION 

30 It has now been found that the product of expressing a 
substantial part of the HIV-pol gene in a suitable host has 
antigenic properties which allows the above-mentioned 
problems to be overcome. 

Thus according to one aspect of the present invention 

35 there is provided the use as an antigenic reagent in the 
diagnostic test or as a vaccine component of a polypeptide 
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comprising a substamtial portion of each of more than one of 
the constituent proteins coded for by the HIV-pol gene* 

Diagnostic kits and vaccines comprising said polypeptide 
form further aspects of the present invention. 
5 The HTV-pol gene codes for foiir enzymes, namely a 

protease, a reverse transcriptase, a ribonuclease referred 
to as RNAse H and an enzyme referred to as Integrase. 

It is believed that during infection of a T cell by HIV 
a full length precursor is e3q)ressed which is then cut up 
10 into the discrete proteins listed above. These have the 
following activities and (it is thought) act in the order 
indicated: - 

Protease Precursor Cleavage 

Reverse Trsoiscriptase Preparation of viral DNA from viral 
IS RNA 

RNAse H Destruction of viral RNA leaving 

newly synthesised DNA 

Integrase Insertion of said DNA into host cell 

genome 

20 According to a preferred aspect of the present 

invention, said constituent proteins are enzymes coded for 
by the HIV-pol gene and the polypeptide thus comprises a 
substantial portion of each of a plurality of enzymes 
selected from HIV-pol protease, HIV-pol reverse tramscrip- 

25 tase, HIV-pol RNAse H and HIV-pol Integrase • Most prefer- 
aJDly, the polypeptide comprises substantial portions of all 
four of said enzymes. 

In vivo , the initial product of expressing the HIV-pol 
gene is cleaved into its individual elements by the 

30 protease. The active site for proteolytic activity occurs 
adjacent the NH2-*terminu& of the expression product, 
corresponding to the 5^ -end of the protease gene. 
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According to a preferred aspect of the present 
invention, the polypeptide omits at least that part of the 
amino acid sequence of the HIV-pol protease gene which codes 
for the active site responsible for proteolytic activity. By 
5 omitting this portion, the integrity of the polypeptide is 
maintained and it is less liable to degrade. 

ffilEF DESCRIPTTQW ftg nP&WT^r^e 

Figure 1 is a schematic diagram showing the procedure of 

Example 1; 

10 Figure 2 shows the results of electrophoresis tests 
carried out in the manner explained in Example 2; and 

Figure 3 is a graph showing the results of the 
experiments, carried out in Example 3. 

BEST MODE FOR CARR YING OUT THE TWVTCICTIQW 

15 The HIV-pol gene of several strains of HIV-l has been 
cloned and the corresponding amino acid sequences derived 
from the determined DNA sequences. The amino acid sequences 
of ten strains appear in the accompanying Table 1 at. the end 
of this disclosure. In Table 1, the full sequence of strain 

20 HIV HXB2 is given, whereas for the other nine strains, only 
sequence differences are listed. As used herein, the term 
"constituent protein coded for by the HIV-pol gene" refers 
to a protein having sufficient amino acid homology with the 
sequence of HIV HXB2 appearing in the accompanying Tabli so 

25 as to result in antibodies raised against the protein cross- 
reacting with a polypeptide consisting of the precise, amino 
acid sequence of HIV HXB2. 

The HIV-pol gene can be expressed to produce the desired 
polypeptide by various techniques, e.g. some or all of the 

30 baculovirus techniques described in U.S. Patent 4,745,051 to 
Gale E. Smith et al issued on May 17, 1988; Baculovirus 
Vectors for Expression of Foreign Genes by c. Yong Kang, 
Advances in Virus Research, Vol. 35, pp 177-192, Academic 
Press Inc., 1988; A Manual of Methods for Baculovirus 

35 Vectors and Insect Cell Culture Procedures, Max D. Summers 
and Gale E. Smith, May 1987, Texas ASM University; and 
Baculoviruses as Gene Expression Vectors, Lois K. Miller, 
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Ann* Rev, Microbiol. 12.# PP 177-1991; the disclosxxres of 
which are incorporated herein by reference. However our 
Canadian Patent Application Serial No. 591,908 filed on 23rd 
February 1989 (and equivalent British Patent Application 
5 Serial No. 89 04426.7 filed on February 27, 1989 and US 
Patent Application Serial No- 316,768 filed on February 28, 
1989) describes and claims an improved baculovirus 
expression system capable of producing foreign gene proteins 
at high levels and the use of this expression system is 

10 particulaurly preferred for expressing the polypeptide of the 
present invention. 

The process disclosed in our Cemadiem patent employs a 
recombinant baculovirus containing at least a major part of 
a polyhedrin gene promoter region, a transcription 

15 termination sequence of a polyhedrin structural gene, a 
foreign structural gene (e.g. an HIV-pol gene) having a 
translation start codon followed by coding sequences and a 
translation stop codon. The foreign gene is located between 
the promoter region and the termination sequence. 

20 Immediately upstream of the start codon there is a putative 
insect cell ribosome binding site for the polyhedrin gene 
effective for overcoming resistance of susceptible insect 
cells to express the foreign gene at a high level. The 
putative ribosome binding site comprises at least the final 

25 four nucleotides of the sequence 5 * -ACCTATAAAT-3 ^ • 

Example 3 of the Canadian application describes the 
production of the pol protein of HIV-1 in a baculovirus 
expression system based on Autograoha califomica 
nucleopolyhedrosis virus (ACNPV) and specifies that a 

30 recombinant baculovirus designated ACNPV-HIV-YK-pol has been 
deposited at the American Type Culture Collection of 12301 
Parklawn Drive, Rockville MD 20852, USA under Accession No- 
ATCC VR 2233. Deposit was made on November 30, 1988. The 
disclosure of our Canadian Patent Application Serial No. 

35 591,908 is incorporated herein by reference. 

Utilising the procedures described in Example 3 of 
Caoiadian Patent Application Serial No. 591, 908, a polypeptide 



Polj^eptide can be us«^ , 

entiUwj Clinlca, Testis,, , JZ I^bUeation 
. >re.„, ceorge s;.ee" ^' x'" ^ "e^ 

Polypeptide Uckin, the f^ff, "^^^^io" as , f 

to the the'p'ol^!"::- ''^''^ = an. 3 .eX.te 

■liagnostlc reagent. ^n«Pti-Je and Its use as a 



tene 



« Plas.id pH^,B contaiS^ tt.^ f"' '"^ ^"3»ent 
pacis. The resulting ra«Jr!! !^ ^'^ sites or 

.'r^l^ -P^oXXe^'T' ^' - 

•trand«l crossover itajeer c»tai!i: * d^^le- 
Ba«I site, the putative ins^'T! "/"^ ""-1- -a. a 
«U rlboso« binding site ^T.^f^''^ toaiEasfla (sra, 
homology searching sIguenCs ' 1 ^ ™=^'»«^^th. ' 
" !«'»^- the pol gIL 4s iL t r"'^= "^t* the 5. 
transror^d. ^e r..l^^:i ^^^^ " the ssti sit. and 
"olated, digested with sm I T 'P""'-"*"! 2) was 
«lth another crossover liS; ''^'^'^^ "^'ted 

at the termini, Ba^ s^e^ ""-^v. 
35 homology searching segu«,c« w^w ' " """^otlW of the 
« - pol gene, .e r^^C^ZZlZlZZ^- 



PCr/CA90/00062 

WO 90/10230 

6 

(pUC18-Dpol 3) contains the putative SF9 cell ribosome 
binding site (P) followed with pol open reading frame 
starting with the first ATG (TI> codon (map xinit 2357-2359) 
in the pol gene and the translation termination (TT) codon 
5 TAG (map unit 5093-5095) . This whole cassette was flanked 
with BamHl sites. The BamHl fragment was isolated and 
inserted into the BamHl site of the pAcYMl baculovirus 
transfer vector (pAcYMl-Dpol) . The pAcYMl-Dpol transfer 
vector DNA was used to co-transfect SF9 cells with wild type 
10 AcNPV DKA to isolated recombinant AcNPV HIV-YK pol virus. 

BXAMPEE 2 

y ^-r-ofisnoTi of pni nana p r-oduets bv recombinant baculQviruses 
Recombinant ACNPV-HIV«Hpol contains an insert comprising 
essentially the whole DNA sequence of the HIV-pol gene (see 
15 Table 2 at the end of the present disclosure) . When 

expressed, the resulting full length gene product of the 
HIV-pol gene is "processed", i.e. the proteolytic active 
site of the HIV pol protease gene cleaves the protein into 
66 KD, 51 kD and 32 kD fragments. 
20 By way of comparison, recombinant AcSNPV-HIVYKpol (see 
Table 3 at the end of the present disclosure) omits 
NHj-terminal amino acid sequences containing the proteolytic 
active site of the HIV-pol protease. When expressed, the 
resulting gene product is not "processed", i.e. the - 95 kD 
25 protein remains intact. 

The following experiments illustrate this. 
Uninfected fi. fruqioerda (SF9) cells, or.SF9 cell 
infected with recombinant baculoviruses AcNPV-HlVWHpol , 
AcNPV-HIVYKpol or with wild-type AcNPV, were harvested after 
30 72 hours of infection. Lysates of the infected or uninfected 
cells were electrophoresed in a 12% polyacrylamide Laemmli 
gel and proteins are identified by either Coomassie blue 
staining (S) or Western blot analyses (W) using the standard 
HIH HIV positive immunoglobulin. As shown in Figure 2, lanes 
35 1, 2 and 3 represents the lysates of AcNPV-HIVYHpol • 

recombinant virus infected cells, lanes 4, 5 and 6 represent 
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the lysates of AcNPV-HIVYKpol recombinant virus infected 
cells, lane 7 shows the wild-type AcNPV infected cell 
lysate, lane 8 shows uninfected cell lysate and lane 9 shows 
molecular weight markers. Une 3 and 6 show the whole cell 
5 lysate, lanes 2 and 5 show proteins in the infected cell ' 
nuclei and lanes 1 and 4 show proteins in the infected cell 
cytoplasm. P denotes polyhedrin protein and arrows show 95K 
Dal uncleaved pol gene product representing 91 amino acid 
deletion of protease produced by AcNPV-HIVYKpol virus and 
10 66K Dal, 51 K Dal and 33K Dal processed pol gene products in 
AcNPV-HIVYHpol virus infected cells. 



EXAMPLE 3 . 

A. Prod uction of pol gene produrstr 

Recombinant ACNPV-mVYKpol virus infected Spodooter-a 
15 fg\tqA.pey<a^ (SF9) cells were harvested 4 days after ' 

infection. Nuclei of infected cells containing most of the 
pol gene product were isolated by treating the infected 
cells with 0.1% Triton X-100 and 0.5% NP40 on ice for 20 
minutes followed by centrifugation at 750 g for lo minutes. 
20 The pelleted nuclei were denatured with 1% SOS in TRISrHCl 
pH 8.0 at room temperature for 30 minutes. The cellular- DKAs 
were removed by ethanol precipitation using 2 volumes of 
100% ethanol. The SDS in the solution were removed by aM- 
ition of 25 mM KCL incubated at 4'C for 30 minutes: followed ' 
25 by centrifugation at 12,700 g for 15 minutes. The pol gene 
product in the supernatant was used for anti-pol ELISA. 

B. Detection o f HIV antibodies hv ET.Tsa 

The pol antigen was diluted in PBS and dispensed in a 
microtiter plate (Nunc cat 269620). The concentration of pol 
30 to coat plates was determined empirically on the strength of 
bands on polyacrylamide gels. ' 

The concentration of pol necessary to coat one well was 
between 1 and 10 ng. 

The plate was covered and incubated at 4*C. The time of 
35 incubation varied between 12 and 24 hrs without no apparent 
differences in reactivity. 
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The plates were then washed three times in PBS tween 20 
employing a Skatron plate washer. 

Various standards, NIH HIV+ iinmxinoglobulin (NIH STD) , 
pool HIV+ plasma (PAT STD) and plasma from non-infected 
5 individuals (NS) were employed. The standards were diluted 
beginning at 1:200 for NIH STD, and l:io for PAT STD and NS. 
Unknowns were tested usually at 1:50 but dilutions as high 
as 1:10 cah be employed. 

All samples were inactivated before testing. Normal sera 

10 were processed in the same fashion as sera from AIDS 
patients. The inactivation was performed with 
4'-aminpethyltrioxsalen- hydrochloride (AMT) from Lee 
Biomoleculaf Research Inc. (San Diego, California cat 231) 
and an ultra violet light trans-illuminator (Spectroline 

15 model TC-365, Fisher Scientific Ottawa Ont.). The AMT was 
reconstituted in 50% ethanol at 1 ng/ml. The sera was 
aliquoted in Eppendorf tubes and for every 100 ^1 of serum 
or plasma, 10 imI of AMT was added to the sample. The 
samples were layed in the transilluminator and irradiated 

20 for 5 minutes. An additional 10 iil of AMT was added to the 
sample and the samples were irradiated for a further 5 
minutes. The samples were inactivated by this procedure. 

The incubation time of the human-ant i-pol was 30 to 40 
minutes at room temperature (23 "C) (the. time of incubation 

25 found to be quite critical) . Therefore, all dilutions of 
standards (negative and positive) and unknowns was performed 
in a separate plate. Once all dilutions were done, the 
dilutions (100 fil) were transferred to the ELISA plate 
coated with pol employing a multichannel pipettor. All 

30 dilutions were with PBS Tween 20 (0.1%) . 

The state of the serum or plasma seuaple was foxmd to be 
important. Samples repeatedly frozen and thawed usually gave 
higher backgrounds. This was especially evident with samples 
from normal individuals. 

35 The plates were washed three times in PBS -Tween 20 after 
the 30 minute incubation with the first emtibody. 
A Skatron II plate washer was employed for this purpose ♦ 
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The second antibody used (goat anti-human Ig linked to 
horse radish peroxidase) was an affinity purified reagent 
obtained from Tago Diagnostics (Inter Medico To DNT cat 
2393). An appropriate dilution was determined experimen- 
5 tally (approximately 1:2,000) is made in PBS-Tween 20 
(0.1%). 100 fil was dispensed into the wells except for one 
which will be employed as a blank for the plate reader* The 
plate was incubated for 1 hour at room temperature/ 

The plates were washed three times with PBS-Tween 20 
10 employing the Skatron II plate washer. 

Freshly prepared substrate (100 /il) was added to the 
wells and after 20 minutes the reaction stopped with the 
addition of. 100 fil of 0.07M H2SD4, 

The plate was read at 450 nm in the BIOTEK BL/310 ELISA 
15 plate reader. A hard copy of the data was obtained from the 
reader and the data also stored directly onto computer 
diskette for further processing by the Anelisar program. 

Additionally, controls were also performed on each 
plate. In two or three wells no serum or plasma was added. 
20 In one well no primary or secondary antibodies were added 
but substrate was. This well was employed to blank the ELISA 
plate reader. The remaining wells were employed to determine 
the extent of binding of the secondary antibody (Goat 
anti-HIg-HRPO) to POL. Thus, these wells received no primary 
25 antibody but secondary antibody and substrate with the 

appropriate washes in between each incubation. Usually the 
value of this latter control is below 0.1000 OD. 

The results are shown in Figure 3. 

The following materials were use for the anti-pol ELISA 
30 procedure 
Buffers 

Phosphate Buffered Saline fPBS) 

NajHPO^ (dibasic anhydrous) 13.6 g 
NaHjPO^ (monobasic) 2.4 g 

35 NaCl 90.0 g 

Salts are dissolved in 8 litres of distilled deionized 
water and pH is adjusted to 7.2 with NaOH or HCl. This 
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buffer is employed as coating buffer, diluent and washing 
buffer. The latter two buffers are modified as indicated 
below. 

Diluent for primary and sec ondary antibodies and washing 
5 buffer 

PBS + 0.1% Tween 20 (Sigma, St. Loxiis MO) (0.1 ml Tween 
20 + 100 ml PBS) . The diluent buffer is made up daily. 
Substrate buffer 

Equal volumes of O.IM Na2HP04 (0.709 g/50 ml) and O.IM 
10 citric acid (0.960 g/50 ml). The pH is adjusted to 4.0 with 
NaOH or HCl. The substrate buffer is made up weekly, 
pubsty^te 

A tablet (2 mg) of o-phenylenediamine (Sigma cat. P6787) 
is dissolved into 10 ml of substrate buffer. Hydrogen 

15 peroxide (4 jxl of 30%) is added to the solution just prior 
to plating. The solution should be kept in the dark as much 
as possible. 
Stopping reagent 

The enzymatic reaction is stopped with 0.07M H^SO^. 

20 It is a particularly advantageous feature of the 
polypeptides, the use of which is described herein, that 
they cross-react with antibodies against diverse strains of 
HIV. Thus, for example, the polypeptides described herein 
based on HTV-l can cross-react with antibodies raised 

25 against various strains of HIV-1 and HlV-2. Thus they may be 
used in diagnostic kits for detecting either virus category. 
Similarly, in vaccines they can provide broad-spectrum 
protection. 

Industrial Applicability 
30 As will be apparent from the above, the present 

invention can be used in the medical field for testing for 
HIV infection and for immunizing against HIV infection, as 
well as for other diagnostic or prognostic purposes. 
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TABLE 1 



HIV-1 pol protein sequence of HIVHX32 virus 
Data from Hx2xaan Retroviruses and AIDS 1988 
Los Alanos I^ational Laboratory 

AcNPV-'HlVWHpol 

HIVnX32 MetPheP heArgGluAspLeuAXaPhel,euGInSlvLvsAlaAraSluPhaSarSftrt51u ... 19 

HIYBHIBI -fc _ ,^Gin 20 

HIVBH5 Bi„ 20 

HIVPV22 .Gin 28 

KIVBRU 5i„ 20 

HIVMN g 

HIVSF2 ... -jg 

HIVRF - — - — — — _Asn— Pro ■ : Leu 19 

HIVMAL ^ — Asn 

HlVEtl — ■ —— Asn ■ Pro~ -~61y Leu-i~ProLys 19 



HXVHXB2 . . . . : GlnThrArgAloAsnSerProThrApg 

KIVBH1 02 ThF»Af>rjAl«Agr>gar>P.->^Th»TT«gii>^C^,.r7^,, 

ThrArQAXaAsnS»rPf»QThi»T1aS<»><ta..»rtT.| 

ThrArqAlaAsnSaf*PrQThf>TliiSai»jt»»fy-|^f-. 



KIVBH5 
HIYPV22 
KIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIYELI 



-Ser 
-Ser 



28 
40 
40 
40 
40 
0 
28 
28 
28 
23 



HIVHX32 ArgGluLeuGlnVolTrpGlyArgAspAsnAsnSepProSerGluAlaGlyAlaAspArg 

HIVBH102 , 

HIYBHS ■ ■■. 

HIVPV22 

HXVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



-Leu- 



-GlyGlu- 



. . . LysThrLau 
. .ProLeu 




48 
60 
S0 
60 
60 
0 
48 
47 
47 
47 



HIVHXB2 GlnGlyThrValSerPheAsnPhePpoGlnYolThrLeuTppGXnApgPpoLeuVolThr 

HIVBH102 — ixo ; 

HIVBH5 
HIVPV22 
KIVBRU 
HIVMN 
HIYSF2 
HIVRF 
HIVMAL 

HIVELI IIP ■ 



-Il»- 



-Ser- 
-Sep- 



-Ile« 
-Ue- 

-Ue- 
-lie- 
-Il»- 



-He- 
-Vol- 



68 
80 
80 
80 
80 
0 
68 
57 
67 
67 
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Table 1 cont'd 



HIVHXB2 

HivBHiaa 

HIV3H5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

KXVMAL 

HIVEUI 



HIVHXB2 

HXVBH1 02 

KXYBH5 

HIVPV22 

HIVBRU 

HI VMM 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



<*- gag cds end 

IleLysXIeGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyAloAspAspThrVal 



-Arg- 



Val- 



ValApgVal- 



LquGIuGI 



[AcNpV-HIVYKpol Starts 

fletSerLeuProGlyArgTrpLysProLysmetlXeGlyGlyXXeGlyGXy 



— Asn- 
—Asn- 
— Asn- 
— IleAsn- 
~Asn- 



-Arg- 



-Lys- 
-Lys- 
-Lys- 



88 
100 
100 
100 
100 
0 
88 
87 
87 
87 



108 
120 
120 
120 
120 
17 
103 
107 
107 
107 



KXVHX32 PheXXeUysVaXArgGXnTyrAspGXnlleLeuXleGIuIleCysGXyHisLysAXalXe 128 

HXYBH102 \ 11^0 

HIVBH5 * 

HXVPV22 . 

HIVBRU ^ 

HXVMN ^Thr Gly 37 

HXVSF2 ProVaX 128 

HIVRF ~ ^ ' 127 

HIVMAL Lys 127 

HIVELI Pro . Gin 127 



HIVHX82 GXyThrVaXLeuVaXGXyPpoThrProValAsnllelXeGXyArgAsnLeuLeuThrGln ' 1 48 
HIVaH102 : 



KIVBH5 



HIVMN 

HIVSF2 

HIVRF 



HIVELI 



160 
1S0 



HIVPV22 150 

HIVBRU 130 



57 
148 
147 



HIVMAL lie Met 147 



147 
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\/ p66, p51 

HIVBH102 ^ ^^^^^^^^'^^'" '' ^"^^"''^^'''' O^^^^g^P^oIleGluThrValProValLYsLeuLys ^ so 
HIYBHS " 

HXVPV22 Z ^ 

HIVSRU _ * ; ''S® 

HIVMN Leu , , - " ~" '^^^ 

HIVSF2 ~ "~ ^ 

HIVRF _j ■ •• ''^S 

HIVMAL • '•^'7 

HIYELI « ""^^ 

— 



HxUBmL ^^^^^y"^^^^PQ^VP^°>-y^V°^^Y«Gl"TrpPr oLeuThrGluGl»Lv^I^ 188 

HIYBH5 --, '^ . ^ ^ ' 

HIVPY22 , 200 

HIVBRU ^ 200 

HIVWN 21 

HIVSF2 •; 

HIVRF ■ " 

HIYMAL Arg " III 

HIYELI 

. 187 



SIIm?B2 '■^"^"^'^^"^^^V^T'^'-g^'^^S^^'-YsSluGl yLysXleSerLysIXe&lYProSXu 208 

HIYBHS 220 

HIYPV22 ZZZ! 

HIVBRU . • -"^ " 220 

HIVMN He 220 

HIVSF2 ^ '^^^ 

HIVRF ' 2^8 

UTVMAr Tw- i-^*-- *" ~ — 207 



207 



207 



SS02 ^^"^^^'^y^^^"'^^^P'"^^°^^>^«^^Q^^«LvsLYSLYs AspSerThrLysTrpAr^ 228 

HIYBHS — - " 2t^Q 

HIYPV22 , _ - ■ 2<i0 

HIVBRU — 2<»0 

HIVMN " 2<»D 

HIVSF2 zzzmiiiiizz"' 

HIVRF . JZ 228 

HIVMAL ~— ^ 227 

HIYELI He m 227 

. -i-Ae — _ 227 
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Table 1 cont»d 



HIVHXB2 LeuValAspPheArgGluLeuAsnLysApgThrGlnAspPheTrpGluValGlnLeuGXy 2«*8 

HIVBH102 ' • 250 

HIVBH5 Apg 260 

HIVPV22 — 260 

HIVSRU 260 

HIVPW Lys 157 

HIVSF2 248 

HIVRF 247 

HIVmAL ~-~-Asn— — =■ 247 

HIVSLI 247 

HXVHX32 'ZleProHisProAlaGlyLeuLysLysLysLysSerValThrValLeuAspValGIyAsp 268 

HIVBH132 — 280 

HIVBHS 280 

HIYPV22 280 

HIVBRU 280 

HIVMN 177 

KIYSF2 — ^ 268 

HIVRF — • 257 

HIVMAL 257 

HIVELI ~- 267 

HIVHXB2 AlaTyrPheSerVolPpoLeuAspGluAspPheArgLySTyrThrAlaPheThrllePro 268 

HIVBH102 300 

HIVBH5 300 

HIVPV22 300 

HIVBRU 300 

HIVRF LysGlu 287 

HIVMAL 287 

HIVSLI — — ~ ^-Ser 287 

HIVHXB2 SerlleAsnAsnGluThrPpoGlylleArgTypGlnTypAsnValLeuProGlnGlyTrp 308 

HIVBm02 ■ ■ , ■ ■ 320 

HIVBHS SepGly 320 

HIVPV22 320 

HIVBRU 320 

HIVMN — 217 

HIVSF2 : 308 

HIVRF A pg 307 

HIVMAL — 307 

HIVELI 307 
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Table 1 conf d 



KIVHXB2 LysGlySerProAloIlePhoGlnSerSerMetThrLysIloLouGluProPheArgLys 
HIVBHl 02 ——■ III M„ — ■ ^^Mi^M^iM 

HIYBHS 

HIVPY22 ' ^ 

HIVBRU ■■ 

HIVMN 1 

HIVSF2 ^ 

HIVRF 



323 

Lys 340 

340 

340 

340 

237 

328 



HIVMAL " '-^"-7 III 

HIVELI 327 

327 



HIVBHS 
HZVPV22 
HZVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL Lys- 



HIVELI GluMet- 



348 
360 
360 
360 
360 
257 
348 
347 
347 
347 



HIVHXB2 GluIleGlyGXnHisArgThPLysIleGluGluLeuArgGlnHisLeuLeuArgTrpGly 

KIVBH1 02 ~— — r-i » »M »ji_ -n M Mil ■■ iM — — - 

HIVBHS " 

HIVPV22 

HIVBRU < 2 ^ 

HIVMN Alo Arg Z-II I 277 

HIVSF2 1__ tiL 

Sal Glu— .Lysll— 36? 

HIVMAL ^ ^ 

HIVELI Lys Glu 1 357 



368 
380 
380 
380 

300 



HIVHXB2 LeuThrThPPpoAspLysLysHisGlnLysGluProPpoPheLeuTrpMetBlyTyrGlu 

HIVSH102 I m^^mt^^m^^^, ■ » ,■■ 

HIVBHS Phe . 

HIVPV22 " 

HIVBRU ~— - 

HIVMN Phe 

HIVSF2 Phe l^J^ 

HIVRF Phe . 

HIVMAL Phe ~ - , 2.- I 

HIVELI Phe Apg 1 



388 
400 
400 
400 
400 



387 
387 
387 
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Table 1 cont'd 



HIVHXS2 LeuHisProAspLysTrpThpValGlnPpoIleValLeuProGluLysAspSerTrpThr 408 

HIVBHIfla ——————————— — ■ 420 

HIVBHS — — -u — — He— ~ ~— — 420 

HIVPV22 420 

KIVBRU 420 

HIVMN 317 

KIVSF2 — — ■ Met 408 

HIVRF 407 

HIVMAL 61n Asp 61u 407 

HIVELI ' — Ser— Lys— — — Glw — — 407 

HIVHXB2 ValAsnAspIleGlnLysLeuValGlyLysLeuAsnTrpAlaSerGlnlleTyrProGly 428 

KZVaH102 440 

HIVBHS ~ ^' 440 

HIVPV22 440 

HIVBRU — — 440 

HIVRF — ^ Ala 427 

HIVWAL — \ 427 

HIVELI Asn GluArg 427 

HIVHXB2 IleLysValArgGlnLeuCysLysLeuLeuArgGlyThrLysAlaLeuThrGluVolZle 448 

HIVBH102 • 4S0 

HIVBHS 4S0 

HIVPV22 - ■ ■ ' ■ ■■ ■ 460 

HIVBRU — 450 

HIVMN Lys— 357 

HIVSF2 Lys 448 

HIVMAL ———Lys ■■ ■ — ■ ■■■■■ ■■ AlQ AspIloVal 447 

HIVELI — . 447 

HIVHXB2 ProLeuThrGluGluAlaGluLeuGluLeuAloGluAsnArgGluIleLeuLysGluPro 483 

HIVBHn02 . : 480 

HIVPV22 480 

KIVPflN — — • 377 

HIVRF cm Lys 4S7 

HIVMAL AlP 4S7 

HIVELI 457 
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table 1 cont'd 



HIVHXB2 ValHisGlyVolTyrTypAspProSerLysAspLeuIleAloGluIleGlnLysGlnGly 



HIVBH5 

HIVPV22 

HXVBRU 



HIVRF 
HIVMAL 
HIVELI 



HIVBK5 
HIVPV22 
HXVBRU 
HXVmN 
HIVSF2 
HIVRF 
HIVMAL 



HIVBH5 
HIVPV22 



HIVBRU -~-Thr 

HIVPIN 



HIVRF 



HIVMAL IleLysSer- 

HIVELI 



HIVHXB2 

HIVBHia2 

HIVBH5 

HIVPV22 

HIVBRU 



LysIleThrThrGluSerlloValllaTrpGlyLysThrProLysPheLysLeuPpoIIe 



HIVSF2 ValSer 

HIVRF VolAlo 



HIVELI Arg^-Ser Arg Arg- 



ii88 
500 
500 
500 
500 





HIVSF2 Glu Vol 



486 

487 
487 



®^1®^y^^"'^'"PT'»»'-Tv63.nIloTyrGln61uProPheLvsAsnLouLysThrGlyLys 503 



520 
520 
520 
520 
417 
503 
507 



SXnTyr ■■ 537 

HIVELI His [ III 

HIVHXB2 TyrAlQArgMetArgClyAlaHisThrAsaAspVolLysGlnLeuThrGluAlaValGln 528 

HIVBKl 02 — ™™™ ™™ ■ ■ T I M M» M^. rT — T-|TM»1 _ 



540 
540 
540 
540 



HIVSF2 ^ ^1 : 

^ . 528 

• 527 



527 

-Ala 527 



548 
560 
560 

580 
580 



HIVMN ——-Ala . •• 

*° — ^ ^Arg r 457 



548 
547 



HIVMAL. AloGln , 

547 
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Table 1 cont'd 



HIVHX32 SlnLysGluThrTrpGluThrTrpTrpThrGluTypTrpGlnAlaThrTrpIleProGlu 568 

HIVBHia2 ' 

HIVBH5 — ~ . . . - 

HIVPY22 

HIVBRU . 



HIVMN Thr+++- 

HIVSF2- Alo Met 

HIVRF Ala 

HIVMAL Ala 

HIVSLI ^Alo 



HZVBK5 

HIVPV22 

HIVBRU 



HIVMN Val- 

HIVSF2 — 

HIVRF 



HIVMAL ^Thr- 

HIVELI , 



HIVBH5 

HXVPY22 

HIVBRU 

HIVMN — — 

HIVSF2 

HIVRF He 

HIVMAL 




580 
580 
580 
580 
4T7 
568 
567 
557 
5S7 



HIVHXB2 -TrpGluPheVolAsnThrProProLeuYalLysLeuTrpTyrGlnUeuGluLvsGluPro 588 
HIVBH102 . . ■ 



600 
600 
S00 
600 
497 
588 
587 
587 
587 



HIVHXB2 lleValGlyAlaGluThrPheTyrValAspGlyAloAlaAsnArgGluThrLysLeuGly 603 
HIV8H102 — . — — ■■■ ■ ■ ., , 020 

— Ser 520 



— — Arg 520 

Sei 620 

— Lys 517 

608 

607 



— — — tys 

HIVELX —He : 1 

HIVHXS2 LysAlQGlyTyrVolThrAsnApgGlyApgGlnLysValValThPLeuThrAspThpThr 623 

HIVBH102 Lys Pro Asn 640 

HIVBH5 ^ g 0 

HIVPV22 Leu Lys Pro Asn 640 

HIVBRU 

HIVMN . 

HIVSF2 Asp 

HIVRF Asp 

HIVMAL Asp. 

.HIYELI Asp 



628 
527 
627 
627 
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Table 1 cont'd 



HIVHXB2 AsnGlnLysThrGluLeuGlnAlaXloTyrLeuAlaLouGlnAspSorGIyLeuGluVal 648 



HIVBHl 02 

HIVBH5 ^ His- 

HIVPV22 — ^ 



HIVBRU — — — — ^ His 

HIVMN His 



HIYRF 



-His Ser— 

HIVELI ^ Asn 



HIVELI 



HIVBH5 



HIVBRU 



HIVSF2 

HIVRF 

HIVMAL Sep 

HIVELI 



660 
680 
660 
660 



HIVSF2 His 



HIVMAL — Ztl' 'Z ®" 



647 
647 



HIVBH 1 02 ■■ — ■ — — 

HIVBH5 , ^ Lys— > 680 

HIVPV22 " 'r^^ 

HIVBRU ]~ 

HivmN , 1:^^ 

HIVSF2 ~ — U ys 577 

HlVRF ~I I , 

HIVMAL — — — ZH ~ 

HIVELI — , 

Lys 667 

HiUbH^L ^^"^^^°^"^^"^°^^^"g^"^^^^^«g^"SXnLeuII eLysLYSGXuLysVQlTyrLeu 6B8 

HIVBHS . ~ '^^^ 

^ 700 



-Gin— — 7fl0 



HIVPV22 

HIVBRU — 

HIVMN sei 1" * 



668 
887 



HIVSF2 Ser ■ , 

HIVRF Ser ^. 

HIVMAL Ho — ri« 



687 



SJSm?2 ^^°^^''^°^^^°^^°»^^^V^'^^y^^^G^Y51y^:^n5XuGlnVoXA^ 708 



720 



HIVPV22 -~- " ' 



720 



HIVMN • * ~ ^ "^2® 



617 

~ 708 

-Arg — .- ■ ■ 7B7 

r-i 707 

707 
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Table 1 cont^d 



HIVHXB2 AlaGIyXXeArgLysValLeuPheLeuAspGlylleAspLysAXaGlnAspGluHisGIu 728 

HIVBK102 ! He— ~~ ■ 7t^d 

HIVBH5 lie Glu 740 

HIVPV22 — ~ Ile~- 7i>0 

HIVBRU ; 740 

HIVMN GluAsp 637 

HIYSF2 Asn Glu 728 

HIVRF Thi 727 

HIVMAL Ser ^-Glu 727 

HIVELI Gin Glu 727 

HIVHXB2 LysTyrHlsSarAsnTrpArgAlanetAlaSerAspPheAsnLeuProProValValAla 748 

HIVBH102 ■ -■ 7S0 

KIYBH5 : 7S0 

HIVPV22 760 

HIVBRU ■ ■■ 760 

HIVMN He 657 

HIVSF2 — 748 

HIVRF 747 

KIVMAL He 747 

HIVELZ — Asn ■ .^i, — — 7^7 

HZVHXB2 LysGluIleValAlaSerCysAspLysCysGlnLeuLysGlyGluAlaMetHisGlyGln 768 

HIVBH102 780 

HIVBH5 780 

HIVPV22 : 733 

HIVBRU ' 780 

HIVWN ^ — _ 677 

HIVSF2 768 

HIVRF 757 

HIVMAL 767 

HIVELI 757 

HIVHXa2 ValAspCysSerProGlylleTrpGlnLeuAspCysThrHisLeuGluGlyLysVallle 788 

HIVBH102 800 

HIVBH5 — : 

HIVPV22 800 

HIVBRU 800 

HIVMN 597 

HIVSF2 He 788 

HIVRF He 787 

HIVMAL He 787 

HIVELI 787 
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Table 1 cont'd 



wJam M '•°"^°^^°^°^.""^°^^°SerGl YTYrIleGIuAloGlaVoXlleProAloGl»Thr 808 



HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HXVSr2 ' 

HIVRF 



620 
320 
820 
820 
717 
808 

HIVMAL lie . _7 

HIVELI — : 

807 

HIVBH5 " ~" 

HIVPY22 
HIVBRU 
HIYMN 



8i»0 
840 
840 



HIVSF2 ^ ^ "^^"^ 

HIVRF He " " ^® 

XX© y^y^ — 



HIVMAL xXe 

HIVELI - * o-:/ 

VaXVal 827 



HIVHXB2 HisThrAspAsnGlySerAsnPheThrGlyAloThryolArgAIaAloCysTrpTrpAlQ 8^8 



HIVBH102 ser 

HIVBH5 .^Ser 

HIVPV22 — -Ser 



-Lys — aso 

-Lys 



HIVBRU serlhr "~ ^ 

Hivm Pro serlhr Lys I^I yll 

HIVSF2 33,^,, 2a 

HIYRF Sei-Thr- i ^ 

HIVELI Ser-Alo^-Lls ~ Z 

S?02 ^^y^^^^y^°^"^^"''>^^ g^y^^«P^^TyrAsnPrDGInSe rGlnGlyVaIVoiGIuSer 863 

HIVBHS ^ ' ~ " ^ 

HXVPV22 ""^^ 

HIVBRU 030 

HIVMN 

HIVSr2 . 777 

HIVRF • — 8S8 

HIVMAL Asn _ " ^ 

HIVELI • 867 

857 



SUBSTITUTE SHEET 



wo 90/10230 



PCr/C:A90/00062 



22 

Table X cont'd 



HIVHXB2 MetAsnLYsGluLsuUYsLYsIlelleGlvGlnVaXArgAspGlnAlaGXuHisLeuLys 888 

HIVBHia2 9«« 

HIVBH5 

HIVPV22 900 

HIVBRU 900 

HXVMN — ~ — 797 

HIVSF2 Asn 888. 

HIVMAL GXu 887 

HXVELI 8S7 

HXVHXB2 ThrAXaVaXGLnWatAXoVoXPhelXeHisAsnPhoLysArgLysGXyGlylXoGXyGXy 908 

HIVBHia2 923 

HIVBH5 ^ 920 

HIVPV22 920 

HXVBRU 920 

HIVIVIN Arg 817 

HIVSF2 ' ' 908 

HXVRF — — ; 907 

HIVMAL ' 907 

HIVEUI ArgArg 907 

HXVHXB2 TyrSerAXoGXyGXuArgXXeVQXAspXXeXXeAXaThrAspZleGXnThrLysGXuLeu 923 

HIVBH102 9<i0 

HIVBH5 9<»0 

HIVPV22 9*»fl 

HIVBRU — 9*»0 

HIVIW Gly 837 

HIVSF2 : 923 

HIYRF 927 

HIVMAL IXo Mot ' 927 

HIVELI IXe ^ ^ 927 



HIVHXB2 GXnLysGXnlXeThrLysIXeGlnAsnPheApgVaXTypTyrArgAspSerArgAsnSer 9<f8 

HIVBH102 Pro 960 

H1V8H5 : Pro 960 

HIVPV22 Pro 960 

HIVBRU AspPro 960 

HIVMN AspPro 857 

HIVSF2 — — — — — AsnLysAspPro 9^*8 

HXVRF AspPro 947 

HIVMAL ~ Asn AspPro 947 

HIVELI IXe AspPro 947 
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Table 1 cont'd 



HIYHXB2 LeuTrpLysGlyPpoAlaLysLeuLeuTrpLysGiyGluGlYAlaVaXVallleGlnAso 

HIvaH102 « : 1 

HIV8H5 ^ _ 

HIVPY22 ^ I , 

HIVBRU " 

HIVMN ; 

HIVSF2 — " 



HIVRF His- 

HIVMAL II© 

HIVELI lie 



963 
960 
980 
980 
980 
877 
968 
967 
967 
967 



sor eds Tstart -> 

HIVHXB2 AsnSepAspIleLysValValPpoArgArgLysAloLysnoHeArgAs^TyrGlYLys 988 



HIVBH102 
HIVBH5 
HIVPV22 
HIVBRU 



1000 
1000 
1000 
1000 



HIVMN — Asn « Vox 897 

HIVSF2 — 



HIVRF 
HIVMAL 



HIVELI Lys val- 



988 
987 
987 
987 



HIVHX32 GlnMetAlaGlyAspAspCysValAlaSepArgGlnAspGluAsp+++ 1 004 

HIVBH102 -^.- ' -i — . M »i. — — , 1016 

HIVBH5 

HIVPV22 TBI 6 

HIVBRU ^ 

HIVMN ThP g,, 

H1VSF2 . 

HIVRF 

HIVMAL — GXyGly — — . 1003 

HIVELI -~ 
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TABLE 2 

HIV-l pol gene HXVHXB2 Sequence " 
Data from Human Retroviruses and AIDS, 1988 
Los Alamos National Laboratory 

AcJIPV-HIVWtlDol Virus 

" . RB5 

Bam HI ti 
5 ' -GGATCCTATAAATATG tttttta gggaagatct 
pol cds start (NH2*tQrminus uncertoin) •> 

2101 ggaettceta caagggaagg ccagggaatt ttetteagag cagoeeogog ecoocagccc 

2161 eaeeagaaga gagettcogg tetggggtog agacaaeqae teeeeeteag aogcoggoge 

2221 egatogaeoa ggaaetgtat eetttooett ceeteaggte aetetttgge oacgaeceet 

22S1 egtcocooTA Aogotagggg ggcaactooa ggaageteta ^ttogataeog gagcagatga 
<- gag cds end 

2541 taeagtattot gaagaaatga gtttgceagg aagatggaaa eeaaaaotga tagggggaat 

2<»01 tggaggtttt ateaoagtoa gaeagtotga teogatacte otagooatct gtggaeataa 

2<»61 agctataggt ceagtattag taggaeetae aectgtcaoe ataottggaa goootctgtt 

2521 gaeteagatt ggttgeoett taaottttee eattagecet attgogaetg toeeagtooa 

2531 attaaagcea ggoatggotg geeeaaoagt toaaeoatgg eeattgaeag aogooaaaat 

2S'»1 aaaageatto gtogaaattt gtacagagat ggoaaoggaa gggaaaattt coaoaattgg 

2701 gcctgaaaat ccatocaata etccagtott tgccatoaag aaoaaagaaa gtoctoaatg 

2781 gogaaaatto gtagotttca gagoacttoa toagagoact caogocttct gggoagttca 

2821 ottoggooto ceaeatcceg cogggttaoa aaagaaaooa teagtaocag tactggotgt 

2881 gggtgotgca tattttteog tteeettaga tgaogoette aggoogtota ctgeatttae 

2941 eatacetagt atooacaatg agaeaeeagg gattagotot cagtocaatg tgcttecoea 

3001 gggotggaaa ggateaceag caatattcea aagtogcotg oeoaaoatct tagagccttt 

3081 togaaaoeaa aatecagaca togttateta teaataeatg gatgatttgt atgtoggatc 

3121 tgacttogoo atagggcagc otagoocooa aatagoggog ctgagocoac atctgttgag a 

3181 gtggggoett occQcaeeag ocoaoaaaca teagaaogoa eetccottec tttggatggg 

3241 ttatgaaete eoteptgata oatggaeogt oeogeetoto gtgctgeeog oooaagacog ' 

3301 etggoetgte aotgoeatae ogoogttogt ggggaoattg aattgggcoa gtcagattto 

3361 eecogggatt aaagtaoggc oottotgtaa actecttaga ggaaccoaog cactoocaga 

3421 ogtaatacco ctaocogaog aagcogogct ogoactggca goaaocogag agattctaoo 
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Table 2 cont'd 





agaaceagta ea'tggagtgt 


attatgoece 


oteaoaogac 


ftOQtogcQg 


QQQ^acaana 


3541 


gcaggggcaa ggceaatgga 


eatateaaot 


'tto'tcaagag 






3601 


aggaaaotat gcaagaatga 




tlMW ^bUU b 


y wuuuiSwau w 


teocagogge 


3661 


og^gcoaaoa ataoecaeog 






Qogoe^ccto 


aatttaaoet 


3721 


gccco-tioeaa QoggooaeolL 




9^99^*<*<3gQg 


ta't'tggectQg 


ecacetggat 


3781 


tcctaaatoo oaot'ttfit'fen 




cttogtgoaa 


ttatggtace 


ogt'tagagaa 


3841 


aaoQCcco^Q a^aaoaoeoa 




^g^agacggg 


gcage^aaea 


gggogQctoa 


3901 




to^acMZw wag 


oggoagaeoa 


aaagftgtca 


ecctoQc^ga 


3961 


cocQQCQQCi^ cogQagoe^g 






gctttgcagg 


attcgggatt 


4021 


OQOaaliQaac atnataoeon 


uw^cucaa wu 


tgcQ't'togga 


Qtcat'tcaag 


cacaaceaga 


4081 






□Qt^gogeog 


t.toot;aaaoa 


aggaaaaggt 


4141 


V w WW *y wjjgjjl «*kJ%»%aiWU 


eawuvsyiiiugg 


oottggogga 


oatgoacaag 


tagataaat^t 


4201 




aagbDCxabb 


t^tago'tgga 


a^ogo'taagg 


cceaagatga 


4261 




B^^ggagage 


aotggetegt 


gattttaaee 


tgeeaeetgt 


4321 


ogtogeaaoa goaatagtag 




bBau^gbCog 


c taaoQ gga g 


oogecqtgca 


4381 


'tggacaagta gactgtogtc 


eaaaaa^at.o 

• JJ 3j *»M 




Xg^QCQCQC^ 


'tagaaggoao 


4441 


ogttatectg g-tagcag-ttic 


atgtagccog 


•*JJJ** 




•»bQx.^ccogc 


4501 


agaaaeoggg caggaoacag 


catattttct 


tt t; 0 0 Q □ "tta 




ggccogtooQ 


4561 


oacaotacQt actgacoatg 


geogcaottt 


caecggtgct 


acggttaggg 


ccgeetgttg 


4621 


gtgggcggga ateaogcagg 


oatttggaat 


'teectocaa't 


eeeeaoogte 


Qoggagtagt 


4681 


ogaotetatg aatoaagao't 


laaagoaaat 




9^aagagQi^c 


oggetgooea 


4741 


tettaogoca geagtacaaa 


tggcogtatt 


eotccacaot 


tttQoaagoa 


aaggggggat 


4801 


tggggggtae agtgeogggg 


ooagaatogt 


ogacQtoata 


gcQQcogacQ 


taeoaactoa 


4861 


agaottaeaa aaocQoo'tta 


eooaaattCQ 


aoottttcgg 


gtttottacA 


Gggocogcag 
A 3'sj. 


4921 


ooat^coctt tggaoaggae 


eageoaogct 


eetctggaaa 
5'sj 


gGTgaagggg 
/\ 


cog tog toot 


4981 


oeaoga^aat ctgtgoeataa 


oagtagtgce 


aagaogoQQa 


gcaaagatca 
sor 23 kO i 


ttagggottA 
cds start 



TGBoocwcog atggcoggtg otgottgtgt ggcoogtogg coggotgogg otTtgi^ -n. 

Crossover <- poi e„tf 
linker Mquence 
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TABLE 3 

HIV-l pol gene 
HIVHXB2 Sequence 

Data from Human Retroviruses and AIDS/ 1988 
Los Alamos National Laboratory 

AcNPV^HIvwilDol Virus 

— RB5 

Baa HI ti (Acnpv^hivwUdoI start) 

5 • •GGATCCTATAAATrtTG tttttta gggaagatst 

pol etfs start (NH2-terminus uneertoin) -> 

2101 ggcettccta eaagggoogg eeagggaott ttettcogag eagoeeagag ecoocagccc 

21^1 caceagoaga gagcttcogg tctggggtag agacaocQoe receeetcog oagcaggogc 

2221 egatogacQQ ggooetgtat cstttaactt ecctcaggtc octctttggc aacgocccct 

2281 cgteaeaaTA Aogatagggg ggeooetoaa ggaogctcta ttegotocag gageogatga 

AcNP\y-HIVYKpol 3am KI BBS ti 

Virus S'-GGATCCrrATAAATMG Cti; AcNPV-HTVYKpol Start) 

2341 taeagtattOi googaootgo gtttgccogg aagotggaaa ecaaooatgo tagggggaot 
2401 tggoggtttt atsaoogtaa gacogtotga tcogatoete ategooatet gtggoeatoa 
24S1 agetotaggt acagtattog toggacetoe aeetgteaae a:taattggaa gaaatctgtt 
2S21 gaeteagatt ggttgeoett taaottttec eottageeet attgagaetg toeeagtoao 
2581 Qttaeiageea ggoatggatg geeeaaaogt toooeaotgg eeattgoeag aogaaaotiQt 
2641 oaaogcatta gtogaoottt gtaeogagat ggaoaaggaa gggaooottt coaaaattgg 
2701 gcctgaaoat eeatacaoto ctccogtott tgccotaaog oaoaaagaca gtactaaatg 
2781 gogaaaatta gtogatttca gagoocttaa taogogaoct caagacttct gggaagttco 
2821 attoggaata ccaeatescg cogggttooa aaagaaaoao tsogtaaeag tactggatgt 
2881 gggtgotgca tottttteog ttseettoga tgaagaettc aggoagtato ctgeatttac 
2941 cotacetogt otaoaeaotg ogaeaeeagg gattagatat eagtaeaatg tgetteeaea 
3001 gggatggaao ggateeeeag eaatatteea aagtageatg oeaaaaatct tagogeettt 
3051 tagaaoQcaa aotceagaea tagttoteta tcoataeatg gotgatttgt atgtaggotc 
3121 tgaettagaa atagggeagc atagaaeaaa aotqgaggag ctgagacaac otctgttgog 
3181 gtggggoctt aceoeaeeag aeaaaoaasa tcogaaogoa cetccottcc tttggatggg 
3241 ttotgaaete eoteetgato aatggaeagt aeageetata gtgetgeeag eaaaagacog 
3301 etggactgte aatgaeatae egaagttogt ggggoaattg oattgggeoa gtcagattta 
33S1 eceagggatt aoagtaagge oattatgtaa aeteettaga ggaaecaaag eactaaeaga 
3421 agtaotaccQ ctaacogaag oageagaget agaoetggea gaaaaeagag agattetaao 
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Table 3 eon«'d 



3481 ogooceogta cotagogtgt attatg teaoaogac ttaotogeog oaotaoagw 

3541 gcoggggcoQ ggccootggo eototeoaat ttateaag tttoaao atstgoaoac 

3691 oggcoaatat gcoBgootga ggggtgeeea eactoatgot gtaaoacool: taacagaggc 
36S1 ogtgcaoQoa otaoceocog aoagcatogt oatatgggga aogactecto oatttaaact 
3721 gcccatacaa ooggooacot gggaaacotg gtggacagag tattggcaag ccocrtggat 
3781 tcctgogtgg gagtttgtta ataoccctcc cttagtgaaa ttatggtacc agttagagao 
3841 ogaoeccata gtaggogcag oaaccttcta tgtagatggg gcogataaca gggagaetaa 
3901 ottaggoaaa gcaggatatg ttactaatog aggoagacao oaagtttftca ccctaoctga 
39B1 caeaocaaot eogaagoetg agttacaage aatttateta gctttgeagg attcgg^att 
4021 agaagtaaao atagtaacag octoacaata tgcattagga atcattcaag cacaocpago 
4081 tcaaagtgaa taogagttag tcaatcaaat aatagagcag ttaataaaao aggaaaaggt 
4141 ctatetggca tgggtaceag eacocaaogg aattggogga aotgoacaag tagotaaatt 
4201 ogtcogtgct ggaateogga oagtoetott tttagatgga otagataagg ecoaogatga 
42S1 aoatgagaaa tatcacagta attggagagc aatggctagt gattttaacc tgccacetgt 
4321 ogtogcaaaa gaaatagtog ecagctgtga taaotgteog ctoaaaggag aagceatgca 
*381 tggoBaagto gactgtagte eoggaatatg geaactagat tgtocoaatt tagaoggaaa 
4441 agttatcctg gtagcagttc atgtagccag tggatatata gaagcogaog ttattccagc 
4S01 agoaacaggg coggaaacog cotattttct tttaaoatto gcaggaagat ggccagtaaa 
4561 aacaatacot actgacaatg gcogeaattt cacaggtgct acggttaggg ccgcctgttg 
ft621 gtgggeggga atcaogcogg oatttggaot tccctacaot ccccoaagte aaggagtagt 
4681 agoatctatg aataaagaat toaogaaaat tataggacag gtoagagotc aggctgaaco 
4741 tcttaagoca gcagtacaoa tggoagtott catccacaat tttaeaagaa aaggggggtft 
4801 tggggggtac ogtgeagggg oaagaotagt agaeataata gaaacagaca tocaaactaa 
*861 agaottocao ooacaaatto eoaoootteo aoattttegg gtttottacA: Gggacagcag 

4921 aaattcaett tggaaaggae eageaaaget cctetggaaa gGTgaagggg cogtagtaat 
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Table 3 cont'd 



4981 Qcaogotaat ogtgacotaa aag'tagtgee aagaagoooa gcaoagoteo ^tegggcrttA 

Bam HI 

S041 TGgoaoacag a^ggcaggtg otgottgtgt ggeaogtaga caggatgagg otTAGGATCC 

r 

GCATG-3 
pol end 

5101 ggaaoagttt agtaoaaeae eatotgtatg ttteagggaa ogetogggga tggttttato 

5151 gaeateocta tgaoogeeet eateeaogaa taagtteago agtoeoeote ceaetagggg 

5221 otgctagatt ggtaataaea acatottggg gtetgcatoe oggagaaaga gae^ggeatt: 

5281 tgggtcaggg agtctccata gaatggagga oaoagagata tagcacoeaa g-tagaccctg 

5341 OQCtogeaga ccaoetsatt eatctgto't't octttgactg tttttcAGac ^cbgcta-taa 

A 3'5j 

5401 gaaaggeett ottaggacae atog'ttogce etaggtgtgo ototcaogca ggocateaea 

5461 ogGToggote tctaeootae Uggcactag cogcattaat ooeoeeaoaa aagatoaoge 
5'sj A 

5521 caeetttgce togtgttaeg aaoctgacog oggatagATG goocaagcee eogaogacea 

R art cds start -> 

5581 agggeeaeag agggogeeoc aeaatgaotg gacacTAGag ettttagagg ogettoagao 

<« sor 23 kD cds end 

5641 tgaagctgtt agacotttte etoggatttg gctccatgge ttagggcooc atatctotga 

5701 oaettatggg gatacttggg caggagtgga agceatoota agaottctgc aacaactgct 

5761 gtttotscot tttcAGaatt gggtgtegac aTAGcagaat oggegttac^ cgocogagga 
A 3'sj c- R orf cds end 

5821 gogcaagoaA TGgogccagt ogatcctaga etagogecct ggaogeatcc aggaogtcog 
tat cds start 

5681 eetaaaactg ettgtaeeao ttgetattgt aaaqagtgtt gettteottg eeaagtttgt 

5941 t-teataaeaa oagcettagg cateteetAT GgcAGgoaga ogeggagaea gcgacgaaga 
trs/art cds start -> /\ 3'sj 

6001 geteateaga eeagteagae teatcaogct tctctatcaa ogcoGTaagt egtaeatgta 

(tat, trs/art, 27 kD) S'sj /\ 

6061 AeGeaaeeta taecaatagt agcaatagta geattagtag tegcaataat aatogcaata 
U arf -> 

6121 gttgtgtggt ecatagtaat eatagaatat aggaaaatat taagacaaag aaaaatagac 
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OAXMS: 

1- The use of a polypeptide as a reagent in a 

diagnostic test for HIV infection or as a vaccine against 
HIV infection characterized in that said polypeptide 
5 comprises a substantial portion of each of more than one of 
the enzymes coded for by the HIV-poi gene. 

2. The use claimed in Claim i characterized in that 
said polypeptide comprises a plurality of enzymes selected 
from HIV-pol protease, HIV-pol reverse transcriptase, 

10 HIV-pol RNAse H and HIV-pol Integrase. 

3. The use claimed in Claim 2 characterized in that 
said polypeptide comprises substantial portions of all four 
of said enzymes. 

4. The use claimed in Claim 2 characterized in that 

15 said polypeptide omits at least that part of the amino acid 
sequence of the HIV-pol protease gene which codes for the 
active site responsible for proteolytic activity. 

5. The use claimed in Claim 3 characterized in that said 
polypeptide omits at least that part of the amino acid 

20 sequence of the HIV-pol protease gene which codes for the 
active site responsible for proteolytic activity. 

6. A diagnostic kit for detecting antibodies to HIV 
antigens characterized in that said kit contains as a test 
reagent, a polypeptide as defined in Claim i, claim 2, 

25 Claim 3, Claim 4 or Claim 5. 

7. A vaccine for protecting an individual against Hiv 
infection comprising a polypeptide and a pharmaceutically 
acceptable carrier, characterized in that said polypeptide 
is as claimed in Claim 1, Claim 2, Claim 3, Claim 4 or 

30 Claim 5. 

8 . A polypeptide comprising a substantial portion of 
each of more than one of the enzymes coded for by the 
HIV-pol gene characterised by omitting at least that part of 
the amino acid sequence of the HIV-pol protease gene which 

35 codes for the active site responsible for proteolytic 
activity. 
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9. A polypeptide as claimed in Claim 8 characterized by 

comprising sequences of a plurality of enzymes selected from 
mv-pol protease, HIV-pol reverse transcriptase, HIV-pol 
RNase H and HIV-pol Integrase. 
5 10. A polypeptide according to Claim 9 characterized 

in that said polypeptide contains substantial portions of 
al.1 four of said enzymes. 

11. A polypeptide according to Claim 8 characterized in 
that said polypeptide has an amino acid sequence 
10 stzbstsuitially as shown in Table 3 beginning with the amino 
acid Met marked "AcNPV-HlvyKpol starts". 
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